Dataset statistics
| Number of variables | 23 |
|---|---|
| Number of observations | 129880 |
| Missing cells | 393 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 22.8 MiB |
| Average record size in memory | 184.0 B |
Variable types
| NUM | 18 |
|---|---|
| CAT | 5 |
Reproduction
| Analysis started | 2021-11-22 20:37:41.664026 |
|---|---|
| Analysis finished | 2021-11-22 20:39:08.578353 |
| Duration | 1 minute and 26.91 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Arrival Delay in Minutes is highly correlated with Departure Delay in Minutes | High correlation |
Departure Delay in Minutes is highly correlated with Arrival Delay in Minutes | High correlation |
Seat comfort has 4797 (3.7%) zeros | Zeros |
Departure/Arrival time convenient has 6664 (5.1%) zeros | Zeros |
Food and drink has 5945 (4.6%) zeros | Zeros |
Inflight entertainment has 2978 (2.3%) zeros | Zeros |
Departure Delay in Minutes has 73356 (56.5%) zeros | Zeros |
Arrival Delay in Minutes has 72753 (56.0%) zeros | Zeros |
satisfaction
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1014.7 KiB |
| satisfied | |
|---|---|
| dissatisfied |
| Value | Count | Frequency (%) | |
| satisfied | 71087 | 54.7% | |
| dissatisfied | 58793 | 45.3% |
Length
| Max length | 12 |
|---|---|
| Median length | 9 |
| Mean length | 10.35801509 |
| Min length | 9 |
Gender
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1014.7 KiB |
| Female | |
|---|---|
| Male |
| Value | Count | Frequency (%) | |
| Female | 65899 | 50.7% | |
| Male | 63981 | 49.3% |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.014767478 |
| Min length | 4 |
Customer Type
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1014.7 KiB |
| Loyal Customer | |
|---|---|
| disloyal Customer |
| Value | Count | Frequency (%) | |
| Loyal Customer | 106100 | 81.7% | |
| disloyal Customer | 23780 | 18.3% |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 14.54927626 |
| Min length | 14 |
Age
Real number (ℝ≥0)
| Distinct count | 75 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.42795657530028 |
|---|---|
| Minimum | 7 |
| Maximum | 85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 7 |
|---|---|
| 5-th percentile | 15 |
| Q1 | 27 |
| median | 40 |
| Q3 | 51 |
| 95-th percentile | 64 |
| Maximum | 85 |
| Range | 78 |
| Interquartile range (IQR) | 24 |
Descriptive statistics
| Standard deviation | 15.11935995 |
|---|---|
| Coefficient of variation (CV) | 0.3834680076 |
| Kurtosis | -0.7191402272 |
| Mean | 39.42795658 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | -0.003606211745 |
| Sum | 5120903 |
| Variance | 228.5950453 |
| Value | Count | Frequency (%) | |
| 39 | 3692 | 2.8% | |
| 25 | 3511 | 2.7% | |
| 40 | 3209 | 2.5% | |
| 44 | 3104 | 2.4% | |
| 41 | 3089 | 2.4% | |
| 42 | 3017 | 2.3% | |
| 43 | 2941 | 2.3% | |
| 45 | 2939 | 2.3% | |
| 23 | 2935 | 2.3% | |
| 22 | 2931 | 2.3% | |
| Other values (65) | 98512 | 75.8% |
| Value | Count | Frequency (%) | |
| 7 | 685 | 0.5% | |
| 8 | 797 | 0.6% | |
| 9 | 859 | 0.7% | |
| 10 | 822 | 0.6% | |
| 11 | 837 | 0.6% |
| Value | Count | Frequency (%) | |
| 85 | 25 | < 0.1% | |
| 80 | 110 | 0.1% | |
| 79 | 52 | < 0.1% | |
| 78 | 44 | < 0.1% | |
| 77 | 106 | 0.1% |
Type of Travel
Categorical
| Distinct count | 2 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1014.7 KiB |
| Business travel | |
|---|---|
| Personal Travel |
| Value | Count | Frequency (%) | |
| Business travel | 89693 | 69.1% | |
| Personal Travel | 40187 | 30.9% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 15 |
| Min length | 15 |
Class
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1014.7 KiB |
| Business | |
|---|---|
| Eco | |
| Eco Plus | 9411 |
| Value | Count | Frequency (%) | |
| Business | 62160 | 47.9% | |
| Eco | 58309 | 44.9% | |
| Eco Plus | 9411 | 7.2% |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 5.755274099 |
| Min length | 3 |
Flight Distance
Real number (ℝ≥0)
| Distinct count | 5398 |
|---|---|
| Unique (%) | 4.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1981.409054511857 |
|---|---|
| Minimum | 50 |
| Maximum | 6951 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 50 |
|---|---|
| 5-th percentile | 341 |
| Q1 | 1359 |
| median | 1925 |
| Q3 | 2544 |
| 95-th percentile | 3831 |
| Maximum | 6951 |
| Range | 6901 |
| Interquartile range (IQR) | 1185 |
Descriptive statistics
| Standard deviation | 1027.115606 |
|---|---|
| Coefficient of variation (CV) | 0.5183763561 |
| Kurtosis | 0.3643059944 |
| Mean | 1981.409055 |
| Median Absolute Deviation (MAD) | 594 |
| Skewness | 0.4667475219 |
| Sum | 257345408 |
| Variance | 1054966.467 |
| Value | Count | Frequency (%) | |
| 1963 | 92 | 0.1% | |
| 1812 | 88 | 0.1% | |
| 1639 | 87 | 0.1% | |
| 1981 | 86 | 0.1% | |
| 1789 | 86 | 0.1% | |
| 1766 | 83 | 0.1% | |
| 1759 | 83 | 0.1% | |
| 1748 | 82 | 0.1% | |
| 2022 | 81 | 0.1% | |
| 1769 | 81 | 0.1% | |
| Other values (5388) | 129031 | 99.3% |
| Value | Count | Frequency (%) | |
| 50 | 23 | < 0.1% | |
| 51 | 21 | < 0.1% | |
| 52 | 21 | < 0.1% | |
| 53 | 28 | < 0.1% | |
| 54 | 21 | < 0.1% |
| Value | Count | Frequency (%) | |
| 6951 | 1 | < 0.1% | |
| 6950 | 1 | < 0.1% | |
| 6948 | 1 | < 0.1% | |
| 6924 | 1 | < 0.1% | |
| 6907 | 2 | < 0.1% |
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.838597166615337 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 4797 |
| Zeros (%) | 3.7% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.392983243 |
|---|---|
| Coefficient of variation (CV) | 0.490729456 |
| Kurtosis | -0.9431930858 |
| Mean | 2.838597167 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.09186099833 |
| Sum | 368677 |
| Variance | 1.940402316 |
| Value | Count | Frequency (%) | |
| 3 | 29183 | 22.5% | |
| 2 | 28726 | 22.1% | |
| 4 | 28398 | 21.9% | |
| 1 | 20949 | 16.1% | |
| 5 | 17827 | 13.7% | |
| 0 | 4797 | 3.7% |
| Value | Count | Frequency (%) | |
| 0 | 4797 | 3.7% | |
| 1 | 20949 | 16.1% | |
| 2 | 28726 | 22.1% | |
| 3 | 29183 | 22.5% | |
| 4 | 28398 | 21.9% |
| Value | Count | Frequency (%) | |
| 5 | 17827 | 13.7% | |
| 4 | 28398 | 21.9% | |
| 3 | 29183 | 22.5% | |
| 2 | 28726 | 22.1% | |
| 1 | 20949 | 16.1% |
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.990645210963967 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 6664 |
| Zeros (%) | 5.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.52722437 |
|---|---|
| Coefficient of variation (CV) | 0.5106671847 |
| Kurtosis | -1.089371035 |
| Mean | 2.990645211 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.2522824496 |
| Sum | 388425 |
| Variance | 2.332414277 |
| Value | Count | Frequency (%) | |
| 4 | 29593 | 22.8% | |
| 5 | 26817 | 20.6% | |
| 3 | 23184 | 17.9% | |
| 2 | 22794 | 17.6% | |
| 1 | 20828 | 16.0% | |
| 0 | 6664 | 5.1% |
| Value | Count | Frequency (%) | |
| 0 | 6664 | 5.1% | |
| 1 | 20828 | 16.0% | |
| 2 | 22794 | 17.6% | |
| 3 | 23184 | 17.9% | |
| 4 | 29593 | 22.8% |
| Value | Count | Frequency (%) | |
| 5 | 26817 | 20.6% | |
| 4 | 29593 | 22.8% | |
| 3 | 23184 | 17.9% | |
| 2 | 22794 | 17.6% | |
| 1 | 20828 | 16.0% |
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.851994148444718 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 5945 |
| Zeros (%) | 4.6% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.443729387 |
|---|---|
| Coefficient of variation (CV) | 0.5062175136 |
| Kurtosis | -0.9867275423 |
| Mean | 2.851994148 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.1168129521 |
| Sum | 370417 |
| Variance | 2.084354542 |
| Value | Count | Frequency (%) | |
| 3 | 28150 | 21.7% | |
| 4 | 27216 | 21.0% | |
| 2 | 27146 | 20.9% | |
| 1 | 21076 | 16.2% | |
| 5 | 20347 | 15.7% | |
| 0 | 5945 | 4.6% |
| Value | Count | Frequency (%) | |
| 0 | 5945 | 4.6% | |
| 1 | 21076 | 16.2% | |
| 2 | 27146 | 20.9% | |
| 3 | 28150 | 21.7% | |
| 4 | 27216 | 21.0% |
| Value | Count | Frequency (%) | |
| 5 | 20347 | 15.7% | |
| 4 | 27216 | 21.0% | |
| 3 | 28150 | 21.7% | |
| 2 | 27146 | 20.9% | |
| 1 | 21076 | 16.2% |
Gate location
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.990421927933477 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 2 |
| Zeros (%) | < 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.305969894 |
|---|---|
| Coefficient of variation (CV) | 0.4367176022 |
| Kurtosis | -1.089822453 |
| Mean | 2.990421928 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.0530638946 |
| Sum | 388396 |
| Variance | 1.705557364 |
| Value | Count | Frequency (%) | |
| 3 | 33546 | 25.8% | |
| 4 | 30088 | 23.2% | |
| 2 | 24518 | 18.9% | |
| 1 | 22565 | 17.4% | |
| 5 | 19161 | 14.8% | |
| 0 | 2 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 2 | < 0.1% | |
| 1 | 22565 | 17.4% | |
| 2 | 24518 | 18.9% | |
| 3 | 33546 | 25.8% | |
| 4 | 30088 | 23.2% |
| Value | Count | Frequency (%) | |
| 5 | 19161 | 14.8% | |
| 4 | 30088 | 23.2% | |
| 3 | 33546 | 25.8% | |
| 2 | 24518 | 18.9% | |
| 1 | 22565 | 17.4% |
Inflight wifi service
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.2491299661225748 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 132 |
| Zeros (%) | 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.31881752 |
|---|---|
| Coefficient of variation (CV) | 0.4058986662 |
| Kurtosis | -1.12144606 |
| Mean | 3.249129966 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.1911228457 |
| Sum | 421997 |
| Variance | 1.73927965 |
| Value | Count | Frequency (%) | |
| 4 | 31560 | 24.3% | |
| 5 | 28830 | 22.2% | |
| 3 | 27602 | 21.3% | |
| 2 | 27045 | 20.8% | |
| 1 | 14711 | 11.3% | |
| 0 | 132 | 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 132 | 0.1% | |
| 1 | 14711 | 11.3% | |
| 2 | 27045 | 20.8% | |
| 3 | 27602 | 21.3% | |
| 4 | 31560 | 24.3% |
| Value | Count | Frequency (%) | |
| 5 | 28830 | 22.2% | |
| 4 | 31560 | 24.3% | |
| 3 | 27602 | 21.3% | |
| 2 | 27045 | 20.8% | |
| 1 | 14711 | 11.3% |
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3834770557437635 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 2978 |
| Zeros (%) | 2.3% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.346059144 |
|---|---|
| Coefficient of variation (CV) | 0.3978330937 |
| Kurtosis | -0.5327859187 |
| Mean | 3.383477056 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.6048282202 |
| Sum | 439446 |
| Variance | 1.81187522 |
| Value | Count | Frequency (%) | |
| 4 | 41879 | 32.2% | |
| 5 | 29831 | 23.0% | |
| 3 | 24200 | 18.6% | |
| 2 | 19183 | 14.8% | |
| 1 | 11809 | 9.1% | |
| 0 | 2978 | 2.3% |
| Value | Count | Frequency (%) | |
| 0 | 2978 | 2.3% | |
| 1 | 11809 | 9.1% | |
| 2 | 19183 | 14.8% | |
| 3 | 24200 | 18.6% | |
| 4 | 41879 | 32.2% |
| Value | Count | Frequency (%) | |
| 5 | 29831 | 23.0% | |
| 4 | 41879 | 32.2% | |
| 3 | 24200 | 18.6% | |
| 2 | 19183 | 14.8% | |
| 1 | 11809 | 9.1% |
Online support
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.519702802587003 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.30651069 |
|---|---|
| Coefficient of variation (CV) | 0.3711991505 |
| Kurtosis | -0.8105718251 |
| Mean | 3.519702803 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.57536498 |
| Sum | 457139 |
| Variance | 1.706970184 |
| Value | Count | Frequency (%) | |
| 4 | 41510 | 32.0% | |
| 5 | 35563 | 27.4% | |
| 3 | 21609 | 16.6% | |
| 2 | 17260 | 13.3% | |
| 1 | 13937 | 10.7% | |
| 0 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 13937 | 10.7% | |
| 2 | 17260 | 13.3% | |
| 3 | 21609 | 16.6% | |
| 4 | 41510 | 32.0% |
| Value | Count | Frequency (%) | |
| 5 | 35563 | 27.4% | |
| 4 | 41510 | 32.0% | |
| 3 | 21609 | 16.6% | |
| 2 | 17260 | 13.3% | |
| 1 | 13937 | 10.7% |
Ease of Online booking
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.4721050200184784 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 18 |
| Zeros (%) | < 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.305559648 |
|---|---|
| Coefficient of variation (CV) | 0.3760138707 |
| Kurtosis | -0.9106542561 |
| Mean | 3.47210502 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.4917196477 |
| Sum | 450957 |
| Variance | 1.704485995 |
| Value | Count | Frequency (%) | |
| 4 | 39920 | 30.7% | |
| 5 | 34137 | 26.3% | |
| 3 | 22418 | 17.3% | |
| 2 | 19951 | 15.4% | |
| 1 | 13436 | 10.3% | |
| 0 | 18 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 18 | < 0.1% | |
| 1 | 13436 | 10.3% | |
| 2 | 19951 | 15.4% | |
| 3 | 22418 | 17.3% | |
| 4 | 39920 | 30.7% |
| Value | Count | Frequency (%) | |
| 5 | 34137 | 26.3% | |
| 4 | 39920 | 30.7% | |
| 3 | 22418 | 17.3% | |
| 2 | 19951 | 15.4% | |
| 1 | 13436 | 10.3% |
On-board service
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.465075454265476 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.270835582 |
|---|---|
| Coefficient of variation (CV) | 0.3667555293 |
| Kurtosis | -0.7850230753 |
| Mean | 3.465075454 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.5052698753 |
| Sum | 450044 |
| Variance | 1.615023077 |
| Value | Count | Frequency (%) | |
| 4 | 40675 | 31.3% | |
| 5 | 31724 | 24.4% | |
| 3 | 27037 | 20.8% | |
| 2 | 17174 | 13.2% | |
| 1 | 13265 | 10.2% | |
| 0 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 5 | < 0.1% | |
| 1 | 13265 | 10.2% | |
| 2 | 17174 | 13.2% | |
| 3 | 27037 | 20.8% | |
| 4 | 40675 | 31.3% |
| Value | Count | Frequency (%) | |
| 5 | 31724 | 24.4% | |
| 4 | 40675 | 31.3% | |
| 3 | 27037 | 20.8% | |
| 2 | 17174 | 13.2% | |
| 1 | 13265 | 10.2% |
Leg room service
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.485902371419772 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 444 |
| Zeros (%) | 0.3% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.292225983 |
|---|---|
| Coefficient of variation (CV) | 0.3707005663 |
| Kurtosis | -0.8413209574 |
| Mean | 3.485902371 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.4964400708 |
| Sum | 452749 |
| Variance | 1.669847991 |
| Value | Count | Frequency (%) | |
| 4 | 39698 | 30.6% | |
| 5 | 34385 | 26.5% | |
| 3 | 22467 | 17.3% | |
| 2 | 21745 | 16.7% | |
| 1 | 11141 | 8.6% | |
| 0 | 444 | 0.3% |
| Value | Count | Frequency (%) | |
| 0 | 444 | 0.3% | |
| 1 | 11141 | 8.6% | |
| 2 | 21745 | 16.7% | |
| 3 | 22467 | 17.3% | |
| 4 | 39698 | 30.6% |
| Value | Count | Frequency (%) | |
| 5 | 34385 | 26.5% | |
| 4 | 39698 | 30.6% | |
| 3 | 22467 | 17.3% | |
| 2 | 21745 | 16.7% | |
| 1 | 11141 | 8.6% |
Baggage handling
Real number (ℝ≥0)
| Distinct count | 5 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.695672928857407 |
|---|---|
| Minimum | 1 |
| Maximum | 5 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 4 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.156483397 |
|---|---|
| Coefficient of variation (CV) | 0.312929044 |
| Kurtosis | -0.2375393396 |
| Mean | 3.695672929 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.7430365996 |
| Sum | 479994 |
| Variance | 1.337453847 |
| Value | Count | Frequency (%) | |
| 4 | 48240 | 37.1% | |
| 5 | 35748 | 27.5% | |
| 3 | 24485 | 18.9% | |
| 2 | 13432 | 10.3% | |
| 1 | 7975 | 6.1% |
| Value | Count | Frequency (%) | |
| 1 | 7975 | 6.1% | |
| 2 | 13432 | 10.3% | |
| 3 | 24485 | 18.9% | |
| 4 | 48240 | 37.1% | |
| 5 | 35748 | 27.5% |
| Value | Count | Frequency (%) | |
| 5 | 35748 | 27.5% | |
| 4 | 48240 | 37.1% | |
| 3 | 24485 | 18.9% | |
| 2 | 13432 | 10.3% | |
| 1 | 7975 | 6.1% |
Checkin service
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3408068986757007 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 3 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.260582285 |
|---|---|
| Coefficient of variation (CV) | 0.3773286883 |
| Kurtosis | -0.7935110538 |
| Mean | 3.340806899 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.3924424812 |
| Sum | 433904 |
| Variance | 1.589067697 |
| Value | Count | Frequency (%) | |
| 4 | 36481 | 28.1% | |
| 3 | 35538 | 27.4% | |
| 5 | 27005 | 20.8% | |
| 2 | 15486 | 11.9% | |
| 1 | 15369 | 11.8% | |
| 0 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 1 | 15369 | 11.8% | |
| 2 | 15486 | 11.9% | |
| 3 | 35538 | 27.4% | |
| 4 | 36481 | 28.1% |
| Value | Count | Frequency (%) | |
| 5 | 27005 | 20.8% | |
| 4 | 36481 | 28.1% | |
| 3 | 35538 | 27.4% | |
| 2 | 15486 | 11.9% | |
| 1 | 15369 | 11.8% |
Cleanliness
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.7057591623036648 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 5 |
| Zeros (%) | < 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.151773912 |
|---|---|
| Coefficient of variation (CV) | 0.3108064667 |
| Kurtosis | -0.2088886554 |
| Mean | 3.705759162 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.7560006872 |
| Sum | 481304 |
| Variance | 1.326583144 |
| Value | Count | Frequency (%) | |
| 4 | 48795 | 37.6% | |
| 5 | 35916 | 27.7% | |
| 3 | 23984 | 18.5% | |
| 2 | 13412 | 10.3% | |
| 1 | 7768 | 6.0% | |
| 0 | 5 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 5 | < 0.1% | |
| 1 | 7768 | 6.0% | |
| 2 | 13412 | 10.3% | |
| 3 | 23984 | 18.5% | |
| 4 | 48795 | 37.6% |
| Value | Count | Frequency (%) | |
| 5 | 35916 | 27.7% | |
| 4 | 48795 | 37.6% | |
| 3 | 23984 | 18.5% | |
| 2 | 13412 | 10.3% | |
| 1 | 7768 | 6.0% |
Online boarding
Real number (ℝ≥0)
| Distinct count | 6 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.3525870033877427 |
|---|---|
| Minimum | 0 |
| Maximum | 5 |
| Zeros | 14 |
| Zeros (%) | < 0.1% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 4 |
| 95-th percentile | 5 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.298714502 |
|---|---|
| Coefficient of variation (CV) | 0.387376823 |
| Kurtosis | -0.9380499192 |
| Mean | 3.352587003 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | -0.3664956098 |
| Sum | 435434 |
| Variance | 1.686659358 |
| Value | Count | Frequency (%) | |
| 4 | 35181 | 27.1% | |
| 3 | 30780 | 23.7% | |
| 5 | 29973 | 23.1% | |
| 2 | 18573 | 14.3% | |
| 1 | 15359 | 11.8% | |
| 0 | 14 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 14 | < 0.1% | |
| 1 | 15359 | 11.8% | |
| 2 | 18573 | 14.3% | |
| 3 | 30780 | 23.7% | |
| 4 | 35181 | 27.1% |
| Value | Count | Frequency (%) | |
| 5 | 29973 | 23.1% | |
| 4 | 35181 | 27.1% | |
| 3 | 30780 | 23.7% | |
| 2 | 18573 | 14.3% | |
| 1 | 15359 | 11.8% |
| Distinct count | 466 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 14.713712657838004 |
|---|---|
| Minimum | 0 |
| Maximum | 1592 |
| Zeros | 73356 |
| Zeros (%) | 56.5% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 12 |
| 95-th percentile | 77 |
| Maximum | 1592 |
| Range | 1592 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 38.07112622 |
|---|---|
| Coefficient of variation (CV) | 2.587458862 |
| Kurtosis | 100.6445463 |
| Mean | 14.71371266 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.82198031 |
| Sum | 1911017 |
| Variance | 1449.410651 |
| Value | Count | Frequency (%) | |
| 0 | 73356 | 56.5% | |
| 1 | 3682 | 2.8% | |
| 2 | 2855 | 2.2% | |
| 3 | 2535 | 2.0% | |
| 4 | 2309 | 1.8% | |
| 5 | 2136 | 1.6% | |
| 6 | 1884 | 1.5% | |
| 7 | 1748 | 1.3% | |
| 8 | 1618 | 1.2% | |
| 9 | 1552 | 1.2% | |
| Other values (456) | 36205 | 27.9% |
| Value | Count | Frequency (%) | |
| 0 | 73356 | 56.5% | |
| 1 | 3682 | 2.8% | |
| 2 | 2855 | 2.2% | |
| 3 | 2535 | 2.0% | |
| 4 | 2309 | 1.8% |
| Value | Count | Frequency (%) | |
| 1592 | 1 | < 0.1% | |
| 1305 | 1 | < 0.1% | |
| 1128 | 1 | < 0.1% | |
| 1017 | 1 | < 0.1% | |
| 978 | 1 | < 0.1% |
| Distinct count | 472 |
|---|---|
| Unique (%) | 0.4% |
| Missing | 393 |
| Missing (%) | 0.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 15.09112883918849 |
|---|---|
| Minimum | 0.0 |
| Maximum | 1584.0 |
| Zeros | 72753 |
| Zeros (%) | 56.0% |
| Memory size | 1014.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 13 |
| 95-th percentile | 78 |
| Maximum | 1584 |
| Range | 1584 |
| Interquartile range (IQR) | 13 |
Descriptive statistics
| Standard deviation | 38.46565024 |
|---|---|
| Coefficient of variation (CV) | 2.548891514 |
| Kurtosis | 95.11711419 |
| Mean | 15.09112884 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 6.670124611 |
| Sum | 1954105 |
| Variance | 1479.606248 |
| Value | Count | Frequency (%) | |
| 0 | 72753 | 56.0% | |
| 1 | 2747 | 2.1% | |
| 2 | 2587 | 2.0% | |
| 3 | 2442 | 1.9% | |
| 4 | 2373 | 1.8% | |
| 5 | 2083 | 1.6% | |
| 6 | 2021 | 1.6% | |
| 7 | 1794 | 1.4% | |
| 8 | 1751 | 1.3% | |
| 9 | 1566 | 1.2% | |
| Other values (462) | 37370 | 28.8% |
| Value | Count | Frequency (%) | |
| 0 | 72753 | 56.0% | |
| 1 | 2747 | 2.1% | |
| 2 | 2587 | 2.0% | |
| 3 | 2442 | 1.9% | |
| 4 | 2373 | 1.8% |
| Value | Count | Frequency (%) | |
| 1584 | 1 | < 0.1% | |
| 1280 | 1 | < 0.1% | |
| 1115 | 1 | < 0.1% | |
| 1011 | 1 | < 0.1% | |
| 970 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| satisfaction | Gender | Customer Type | Age | Type of Travel | Class | Flight Distance | Seat comfort | Departure/Arrival time convenient | Food and drink | Gate location | Inflight wifi service | Inflight entertainment | Online support | Ease of Online booking | On-board service | Leg room service | Baggage handling | Checkin service | Cleanliness | Online boarding | Departure Delay in Minutes | Arrival Delay in Minutes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | satisfied | Female | Loyal Customer | 65 | Personal Travel | Eco | 265 | 0 | 0 | 0 | 2 | 2 | 4 | 2 | 3 | 3 | 0 | 3 | 5 | 3 | 2 | 0 | 0.0 |
| 1 | satisfied | Male | Loyal Customer | 47 | Personal Travel | Business | 2464 | 0 | 0 | 0 | 3 | 0 | 2 | 2 | 3 | 4 | 4 | 4 | 2 | 3 | 2 | 310 | 305.0 |
| 2 | satisfied | Female | Loyal Customer | 15 | Personal Travel | Eco | 2138 | 0 | 0 | 0 | 3 | 2 | 0 | 2 | 2 | 3 | 3 | 4 | 4 | 4 | 2 | 0 | 0.0 |
| 3 | satisfied | Female | Loyal Customer | 60 | Personal Travel | Eco | 623 | 0 | 0 | 0 | 3 | 3 | 4 | 3 | 1 | 1 | 0 | 1 | 4 | 1 | 3 | 0 | 0.0 |
| 4 | satisfied | Female | Loyal Customer | 70 | Personal Travel | Eco | 354 | 0 | 0 | 0 | 3 | 4 | 3 | 4 | 2 | 2 | 0 | 2 | 4 | 2 | 5 | 0 | 0.0 |
| 5 | satisfied | Male | Loyal Customer | 30 | Personal Travel | Eco | 1894 | 0 | 0 | 0 | 3 | 2 | 0 | 2 | 2 | 5 | 4 | 5 | 5 | 4 | 2 | 0 | 0.0 |
| 6 | satisfied | Female | Loyal Customer | 66 | Personal Travel | Eco | 227 | 0 | 0 | 0 | 3 | 2 | 5 | 5 | 5 | 5 | 0 | 5 | 5 | 5 | 3 | 17 | 15.0 |
| 7 | satisfied | Male | Loyal Customer | 10 | Personal Travel | Eco | 1812 | 0 | 0 | 0 | 3 | 2 | 0 | 2 | 2 | 3 | 3 | 4 | 5 | 4 | 2 | 0 | 0.0 |
| 8 | satisfied | Female | Loyal Customer | 56 | Personal Travel | Business | 73 | 0 | 0 | 0 | 3 | 5 | 3 | 5 | 4 | 4 | 0 | 1 | 5 | 4 | 4 | 0 | 0.0 |
| 9 | satisfied | Male | Loyal Customer | 22 | Personal Travel | Eco | 1556 | 0 | 0 | 0 | 3 | 2 | 0 | 2 | 2 | 2 | 4 | 5 | 3 | 4 | 2 | 30 | 26.0 |
Last rows
| satisfaction | Gender | Customer Type | Age | Type of Travel | Class | Flight Distance | Seat comfort | Departure/Arrival time convenient | Food and drink | Gate location | Inflight wifi service | Inflight entertainment | Online support | Ease of Online booking | On-board service | Leg room service | Baggage handling | Checkin service | Cleanliness | Online boarding | Departure Delay in Minutes | Arrival Delay in Minutes | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 129870 | satisfied | Female | disloyal Customer | 70 | Personal Travel | Eco | 1674 | 5 | 4 | 5 | 1 | 5 | 5 | 5 | 5 | 3 | 2 | 4 | 5 | 4 | 5 | 54 | 46.0 |
| 129871 | satisfied | Female | disloyal Customer | 35 | Personal Travel | Eco | 3287 | 5 | 4 | 5 | 3 | 2 | 5 | 2 | 2 | 4 | 5 | 4 | 4 | 3 | 2 | 9 | 0.0 |
| 129872 | satisfied | Female | disloyal Customer | 69 | Personal Travel | Eco | 2240 | 5 | 4 | 5 | 3 | 4 | 5 | 4 | 4 | 5 | 4 | 4 | 3 | 4 | 4 | 4 | 0.0 |
| 129873 | satisfied | Female | disloyal Customer | 63 | Personal Travel | Eco | 1942 | 5 | 5 | 4 | 4 | 3 | 4 | 3 | 3 | 5 | 2 | 5 | 3 | 5 | 3 | 7 | NaN |
| 129874 | satisfied | Female | disloyal Customer | 11 | Personal Travel | Eco | 2752 | 5 | 5 | 5 | 2 | 2 | 5 | 2 | 2 | 3 | 5 | 3 | 5 | 4 | 2 | 5 | 0.0 |
| 129875 | satisfied | Female | disloyal Customer | 29 | Personal Travel | Eco | 1731 | 5 | 5 | 5 | 3 | 2 | 5 | 2 | 2 | 3 | 3 | 4 | 4 | 4 | 2 | 0 | 0.0 |
| 129876 | dissatisfied | Male | disloyal Customer | 63 | Personal Travel | Business | 2087 | 2 | 3 | 2 | 4 | 2 | 1 | 1 | 3 | 2 | 3 | 3 | 1 | 2 | 1 | 174 | 172.0 |
| 129877 | dissatisfied | Male | disloyal Customer | 69 | Personal Travel | Eco | 2320 | 3 | 0 | 3 | 3 | 3 | 2 | 2 | 4 | 4 | 3 | 4 | 2 | 3 | 2 | 155 | 163.0 |
| 129878 | dissatisfied | Male | disloyal Customer | 66 | Personal Travel | Eco | 2450 | 3 | 2 | 3 | 2 | 3 | 2 | 2 | 3 | 3 | 2 | 3 | 2 | 1 | 2 | 193 | 205.0 |
| 129879 | dissatisfied | Female | disloyal Customer | 38 | Personal Travel | Eco | 4307 | 3 | 4 | 3 | 3 | 3 | 3 | 3 | 4 | 5 | 5 | 5 | 3 | 3 | 3 | 185 | 186.0 |